Wordnet Based Lexicon Grammar for Polish
نویسنده
چکیده
In the paper we present a progress report on a long-term ongoing project concerning the lexicon-grammar of Polish. It is based on our former research focused mainly on morphological dictionaries, text understanding and related tools. By Lexicon Grammars we mean grammatical formalisms which are based on the idea that a sentence is the fundamental unit of meaning and that grammatical information should be closely related to words. Organization of the grammatical knowledge into a lexicon results in a powerful NLP tool, particularly well suited to support heuristic parsing. The project is inspired by the achievements of Maurice Gross, Kazimierz Polanski and George Miller. We present the actual state of the project of a wordnet-like lexical network PolNet with particular emphasis on its verbal component, now being converted into the kernel of a lexicon grammar for Polish. We present various aspects of PolNet development and validation within the POLINT-112-SMS project.
منابع مشابه
Recent Advances in Development of a Lexicon-Grammar of Polish: PolNet 3.0
In this paper we present recent works contributing to transformation of the initial PolNet, a Polish wordnet developed at the Adam Mickiewicz University, into a Lexicon Grammar of Polish. We focus on granularity issues that occurred at the stage of including verbnoun collocations as well as information related to language registers.
متن کاملTowards a Lexicon-Grammar of Polish: Extraction of Verbo-Nominal Collocations from Corpora
In the paper we present a contribution to the SyntLex longterm-project aiming at a lexicon-grammar for Polish. A corpus-based method is presented for computer-assisted improvement or/and verification of verbo-nominal lexicongrammars (in application to Polish). Feasibility study.
متن کاملClass-Based Construction of a Verb Lexicon
We present an approach to building a verb lexicon compatible with WordNet but with explicitly stated syntactic and semantic information, using Levin verb classes to systematically construct lexical entries. By using verb classes we capture generalizations about verb behavior and reduce the effort needed to construct the lexicon. The syntactic frames for the verb classes are represented by a Lex...
متن کاملA Large Wordnet-based Sentiment Lexicon for Polish
The applications of plWordNet, a very large wordnet for Polish, do not yet include work on sentiment and emotions. We present a pilot project to annotate plWordNet manually with sentiment polarity values and basic emotion values. We work with lexical units, plWordNet’s basic building blocks.1 So far, we have annotated about 30,000 nominal and adjectival LUs. The resulting lexicon is already one...
متن کاملA CCG-based Approach to Fine-Grained Sentiment Analysis
In this paper, we present a Combinatory Categorial Grammar (CCG) based approach to the classification of emotion in short texts. We develop a method that makes use of the notion put forward by Ortony et al. (1988), that emotions are valenced reactions. This hypothesis sits central to our system, in which we adapt contextual valence shifters to infer the emotional content of a text. We integrate...
متن کامل